AITopics | materials science

Collaborating Authors

materials science

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Construction and Application of Materials Knowledge Graph in Multidisciplinary Materials Science via Large Language Model

Neural Information Processing SystemsMar-20-2026, 23:30:05 GMT

Knowledge in materials science is widely dispersed across extensive scientific literature, posing significant challenges for efficient discovery and integration of new materials. Traditional methods, often reliant on costly and time-consuming experimental approaches, further complicate rapid innovation. Addressing these challenges, the integration of artificial intelligence with materials science has opened avenues for accelerating the discovery process, though it also demands precise annotation, data extraction, and traceability of information. To tackle these issues, this article introduces the Materials Knowledge Graph (MKG), which utilizes advanced natural language processing techniques, integrated with large language models to extract and systematically organize a decade's worth of high-quality research into structured triples, contains 162,605 nodes and 731,772 edges. MKG categorizes information into comprehensive labels such as Name, Formula, and Application, structured around a meticulously designed ontology, thus enhancing data usability and integration. By implementing network-based algorithms, MKG not only facilitates efficient link prediction but also significantly reduces reliance on traditional experimental methods. This structured approach not only streamlines materials research but also lays the groundwork for more sophisticated materials knowledge graphs.

artificial intelligence, large language model, natural language, (9 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.30)

Add feedback

Density of States Prediction of Crystalline Materials via Prompt-guided Multi-Modal Transformer Namkyeong Lee

Neural Information Processing SystemsFeb-16-2026, 22:48:08 GMT

That is, DOS is not solely determined by the crystalline material but also by the energy levels, which has been neglected in previous works. In this paper, we propose to integrate heterogeneous information obtained from the crystalline materials and the energies via a multi-modal transformer, thereby modeling the complex relationships between the atoms in the crystalline materials and various energy levels for DOS prediction. Moreover, we propose to utilize prompts to guide the model to learn the crystal structural system-specific interactions between crystalline materials and energies. Extensive experiments on two types of DOS, i.e., Phonon DOS and Electron DOS, with various real-world scenarios demonstrate the superiority of DOST ransformer .

information, machine learning, natural language, (20 more...)

Neural Information Processing Systems

Country:

North America > United States (0.14)
Asia > Middle East > Israel (0.04)
Asia > Middle East > Iran > Tehran Province > Tehran (0.04)

Industry:

Energy (0.93)
Health & Medicine (0.67)
Materials (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)
Information Technology > Artificial Intelligence > Vision (0.68)

Add feedback

680da2fd0331deecc2e5b7cf0e55e832-Paper-Conference.pdf

Neural Information Processing SystemsFeb-15-2026, 13:22:50 GMT

information, large language model, machine learning, (20 more...)

Neural Information Processing Systems

Country:

Oceania > Australia > New South Wales (0.04)
Asia > China > Hong Kong (0.04)
North America > United States (0.04)

Genre: Research Report > Experimental Study (1.00)

Industry:

Energy > Renewable (0.68)
Energy > Energy Storage (0.46)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.68)
(2 more...)

Add feedback

AI materials discovery now needs to move into the real world

MIT Technology ReviewDec-15-2025, 10:00:00 GMT

Startups flush with cash are building AI-assisted laboratories to find materials far faster and more cheaply, but are still waiting for their ChatGPT moment. The microwave-size instrument at Lila Sciences in Cambridge, Massachusetts, doesn't look all that different from others that I've seen in state-of-the-art materials labs. Inside its vacuum chamber, the machine zaps a palette of different elements to create vaporized particles, which then fly through the chamber and land to create a thin film, using a technique called sputtering. What sets this instrument apart is that artificial intelligence is running the experiment; an AI agent, trained on vast amounts of scientific literature and data, has determined the recipe and is varying the combination of elements. Later, a person will walk the samples, each containing multiple potential catalysts, over to a different part of the lab for testing. Another AI agent will scan and interpret the data, using it to suggest another round of experiments to try to optimize the materials' performance. For now, a human scientist keeps a close eye on the experiments and will approve the next steps on the basis of the AI's suggestions and the test results. But the startup is convinced this AI-controlled machine is a peek into the future of materials discovery--one in which autonomous labs could make it far cheaper and faster to come up with novel and useful compounds. Flush with hundreds of millions of dollars in new funding, Lila Sciences is one of AI's latest unicorns.

discovery, new material, scientist, (16 more...)

MIT Technology Review

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.24)
North America > United States > New York (0.04)
North America > United States > California > Santa Barbara County > Santa Barbara (0.04)
(2 more...)

Genre: Research Report (0.46)

Industry: Energy (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Communications > Social Media (0.95)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.89)

Add feedback

Human-in-the-Loop and AI: Crowdsourcing Metadata Vocabulary for Materials Science

Greenberg, Jane, McClellan, Scott, Ireland, Addy, Sammarco, Robert, Gerber, Colton, Rauch, Christopher B., Kelly, Mat, Kunze, John, An, Yuan, Toberer, Eric

arXiv.org Artificial IntelligenceDec-11-2025

Metadata vocabularies are essential for advancing FAIR and FARR data principles, but their development constrained by limited human resources and inconsistent standardization practices. This paper introduces MatSci-YAMZ, a platform that integrates artificial intelligence (AI) and human-in-the-loop (HILT), including crowdsourcing, to support metadata vocabulary development. The paper reports on a proof-of-concept use case evaluating the AI-HILT model in materials science, a highly interdisciplinary domain Six (6) participants affiliated with the NSF Institute for Data-Driven Dynamical Design (ID4) engaged with the MatSci-YAMZ plaform over several weeks, contributing term definitions and providing examples to prompt the AI-definitions refinement. Nineteen (19) AI-generated definitions were successfully created, with iterative feedback loops demonstrating the feasibility of AI-HILT refinement. Findings confirm the feasibility AI-HILT model highlighting 1) a successful proof of concept, 2) alignment with FAIR and open-science principles, 3) a research protocol to guide future studies, and 4) the potential for scalability across domains. Overall, MatSci-YAMZ's underlying model has the capacity to enhance semantic transparency and reduce time required for consensus building and metadata vocabulary development.

artificial intelligence, social media, vocabulary development, (15 more...)

arXiv.org Artificial Intelligence

2512.09895

Country: North America > United States (1.00)

Genre:

Research Report > New Finding (0.48)
Research Report > Experimental Study (0.46)

Industry: Education > Educational Setting (0.60)

Technology:

Information Technology > Communications > Social Media > Crowdsourcing (0.88)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.66)

Add feedback

Training-Free Active Learning Framework in Materials Science with Large Language Models

Wang, Hongchen, Castañeda, Rafael Espinosa, Werber, Jay R., Fehlis, Yao, Kim, Edward, Hattrick-Simpers, Jason

arXiv.org Artificial IntelligenceDec-5-2025

Active learning (AL) accelerates scientific discovery by prioritizing the most informative experiments, but traditional machine learning (ML) models used in AL suffer from cold-start limitations and domain-specific feature engineering, restricting their generalizability. Large language models (LLMs) offer a new paradigm by leveraging their pretrained knowledge and universal token-based representations to propose experiments directly from text-based descriptions. Here, we introduce an LLM-based active learning framework (LLM-AL) that operates in an iterative few-shot setting and benchmark it against conventional ML models across four diverse materials science datasets. We explored two prompting strategies: one using concise numerical inputs suited for datasets with more compositional and structured features, and another using expanded descriptive text suited for datasets with more experimental and procedural features to provide additional context. Across all datasets, LLM-AL could reduce the number of experiments needed to reach top-performing candidates by over 70% and consistently outperformed traditional ML models. We found that LLM-AL performs broader and more exploratory searches while still reaching the optima with fewer iterations. We further examined the stability boundaries of LLM-AL given the inherent non-determinism of LLMs and found its performance to be broadly consistent across runs, within the variability range typically observed for traditional ML approaches. These results demonstrate that LLM-AL can serve as a generalizable alternative to conventional AL pipelines for more efficient and interpretable experiment selection and potential LLM-driven autonomous discovery.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2511.1973

Country:

North America > United States (0.28)
Europe > Austria (0.28)
North America > Canada > Ontario > Toronto (0.15)

Genre: Research Report > New Finding (1.00)

Industry: Materials (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

CodeDistiller: Automatically Generating Code Libraries for Scientific Coding Agents

Jansen, Peter, Hassan, Samiah, Narasimha, Pragnya

arXiv.org Artificial IntelligenceDec-2-2025

Automated Scientific Discovery (ASD) systems can help automatically generate and run code-based experiments, but their capabilities are limited by the code they can reliably generate from parametric knowledge alone. As a result, current systems either mutate a small number of manually-crafted experiment examples, or operate solely from parametric knowledge, limiting quality and reach. We introduce CodeDistiller, a system that automatically distills large collections of scientific Github repositories into a vetted library of working domain-specific code examples, allowing ASD agents to expand their capabilities without manual effort. Using a combination of automatic and domain-expert evaluation on 250 materials science repositories, we find the best model is capable of producing functional examples for 74% of repositories, while our downstream evaluation shows an ASD agent augmented with a CodeDistiller generated library produces more accurate, complete, and scientifically sound experiments than an agent with only general materials-science code examples.

large language model, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2512.01089

Country:

North America > United States (0.94)
Europe > Austria > Vienna (0.14)

Genre: Research Report (0.82)

Industry: Government > Regional Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.69)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.51)

Add feedback

Generative models for crystalline materials

Metni, Houssam, Ruple, Laura, Walters, Lauren N., Torresi, Luca, Teufel, Jonas, Schopmans, Henrik, Östreicher, Jona, Zhang, Yumeng, Neubert, Marlen, Koide, Yuri, Steiner, Kevin, Link, Paul, Bär, Lukas, Petrova, Mariana, Ceder, Gerbrand, Friederich, Pascal

arXiv.org Artificial IntelligenceDec-1-2025

Understanding structure-property relationships in materials is fundamental in condensed matter physics and materials science. Over the past few years, machine learning (ML) has emerged as a powerful tool for advancing this understanding and accelerating materials discovery. Early ML approaches primarily focused on constructing and screening large material spaces to identify promising candidates for various applications. More recently, research efforts have increasingly shifted toward generating crystal structures using end-to-end generative models. This review analyzes the current state of generative modeling for crystal structure prediction and \textit{de novo} generation. It examines crystal representations, outlines the generative models used to design crystal structures, and evaluates their respective strengths and limitations. Furthermore, the review highlights experimental considerations for evaluating generated structures and provides recommendations for suitable existing software tools. Emerging topics, such as modeling disorder and defects, integration in advanced characterization, and incorporating synthetic feasibility constraints, are explored. Ultimately, this work aims to inform both experimental scientists looking to adapt suitable ML models to their specific circumstances and ML specialists seeking to understand the unique challenges related to inverse materials design and discovery.

evolutionary algorithm, large language model, machine learning, (23 more...)

arXiv.org Artificial Intelligence

2511.22652

Country: North America > United States > California (0.28)

Genre:

Research Report (1.00)
Overview (1.00)

Industry:

Materials (1.00)
Energy (1.00)
Government > Regional Government (0.92)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Natural Language > Generation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
(3 more...)

Add feedback

Why Physics Still Matters: Improving Machine Learning Prediction of Material Properties with Phonon-Informed Datasets

Benítez, Pol, López, Cibrán, Saucedo, Edgardo, Mizoguchi, Teruyasu, Cazorla, Claudio

arXiv.org Artificial IntelligenceNov-20-2025

Machine learning (ML) methods have become powerful tools for predicting material properties with near first-principles accuracy and vastly reduced computational cost. However, the performance of ML models critically depends on the quality, size, and diversity of the training dataset. In materials science, this dependence is particularly important for learning from low-symmetry atomistic configurations that capture thermal excitations, structural defects, and chemical disorder, features that are ubiquitous in real materials but underrepresented in most datasets. The absence of systematic strategies for generating representative training data may therefore limit the predictive power of ML models in technologically critical fields such as energy conversion and photonics. In this work, we assess the effectiveness of graph neural network (GNN) models trained on two fundamentally different types of datasets: one composed of randomly generated atomic configurations and another constructed using physically informed sampling based on lattice vibrations. As a case study, we address the challenging task of predicting electronic and mechanical properties of a prototypical family of optoelectronic materials under realistic finite-temperature conditions. We find that the phonons-informed model consistently outperforms the randomly trained counterpart, despite relying on fewer data points. Explainability analyses further reveal that high-performing models assign greater weight to chemically meaningful bonds that control property variations, underscoring the importance of physically guided data generation. Overall, this work demonstrates that larger datasets do not necessarily yield better GNN predictive models and introduces a simple and general strategy for efficiently constructing high-quality training data in materials informatics.

artificial intelligence, deep learning, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2511.15222

Country: Europe > Spain > Catalonia (0.14)

Genre: Research Report > New Finding (1.00)

Industry: Energy (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

Large language models in materials science and the need for open-source approaches

Yang, Fengxu, Chen, Weitong, Evans, Jack D.

arXiv.org Artificial IntelligenceNov-17-2025

Large language models (LLMs) are rapidly transforming materials science. This review examines recent LLM applications across the materials discovery pipeline, focusing on three key areas: mining scientific literature , predictive modelling, and multi-agent experimental systems. We highlight how LLMs extract valuable information such as synthesis conditions from text, learn structure-property relationships, and can coordinate agentic systems integrating computational tools and laboratory automation. While progress has been largely dependent on closed-source commercial models, our benchmark results demonstrate that open-source alternatives can match performance while offering greater transparency, reproducibility, cost-effectiveness, and data privacy. As open-source models continue to improve, we advocate their broader adoption to build accessible, flexible, and community-driven AI platforms for scientific discovery.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2511.10673

Genre: Research Report > New Finding (0.88)

Industry: Information Technology > Security & Privacy (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback